Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 215094 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 93.2 MiB |
| Average record size in memory | 454.6 B |
Variable types
| NUM | 8 |
|---|---|
| CAT | 4 |
| BOOL | 1 |
Reproduction
| Analysis started | 2020-03-09 05:51:48.501235 |
|---|---|
| Analysis finished | 2020-03-09 06:01:52.642760 |
| Version | pandas-profiling v2.5.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
회원번호 has a high cardinality: 215094 distinct values | High cardinality |
회원이름 has a high cardinality: 150751 distinct values | High cardinality |
담당자 has a high cardinality: 2157 distinct values | High cardinality |
진행률 is highly correlated with 총불입액 | High Correlation |
총불입액 is highly correlated with 진행률 | High Correlation |
상태 has 3665 (1.7%) zeros | Zeros |
| Distinct count | 215094 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120606.92464224943 |
|---|---|
| Minimum | 0 |
| Maximum | 234612 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12196.65 |
| Q1 | 64479.25 |
| median | 121871.5 |
| Q3 | 178243.75 |
| 95-th percentile | 222708.35 |
| Maximum | 234612 |
| Range | 234612 |
| Interquartile range (IQR) | 113764.5 |
Descriptive statistics
| Standard deviation | 66721.49409 |
|---|---|
| Coefficient of variation (CV) | 0.553214455 |
| Kurtosis | -1.159724153 |
| Mean | 120606.9246 |
| Median Absolute Deviation (MAD) | 57552.93947 |
| Skewness | -0.06307497504 |
| Sum | 2.594182585e+10 |
| Variance | 4451757774 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 11343.5 19639.5 36392.5 53461.5 ... 226652.5 228076.5 228248.5 228347.5 234612. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 213309 | 1 | < 0.1% | |
| 4439 | 1 | < 0.1% | |
| 6486 | 1 | < 0.1% | |
| 341 | 1 | < 0.1% | |
| 2388 | 1 | < 0.1% | |
| 14674 | 1 | < 0.1% | |
| 8529 | 1 | < 0.1% | |
| 10576 | 1 | < 0.1% | |
| 53583 | 1 | < 0.1% | |
| Other values (215084) | 215084 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 234612 | 1 | < 0.1% | |
| 234611 | 1 | < 0.1% | |
| 234610 | 1 | < 0.1% | |
| 234609 | 1 | < 0.1% | |
| 234607 | 1 | < 0.1% |
| Distinct count | 215094 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 211A001321 | 1 |
|---|---|
| 1022A00439 | 1 |
| 218A053829 | 1 |
| 209A034678 | 1 |
| 214A039899 | 1 |
| Other values (215089) |
| Value | Count | Frequency (%) | |
| 211A001321 | 1 | < 0.1% | |
| 1022A00439 | 1 | < 0.1% | |
| 218A053829 | 1 | < 0.1% | |
| 209A034678 | 1 | < 0.1% | |
| 214A039899 | 1 | < 0.1% | |
| 211A005424 | 1 | < 0.1% | |
| 218A037566 | 1 | < 0.1% | |
| 215A011382 | 1 | < 0.1% | |
| 214A007287 | 1 | < 0.1% | |
| 218A048025 | 1 | < 0.1% | |
| Other values (215084) | 215084 | > 99.9% |
Length
| Max length | 11 |
|---|---|
| Mean length | 10.00073456 |
| Min length | 10 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 16 | 57.1% | |
| Decimal_Number | 10 | 35.7% | |
| Dash_Punctuation | 1 | 3.6% | |
| Connector_Punctuation | 1 | 3.6% |
| Value | Count | Frequency (%) | |
| Latin | 16 | 57.1% | |
| Common | 12 | 42.9% |
| Value | Count | Frequency (%) | |
| ASCII | 28 | 100.0% |
| Distinct count | 150751 |
|---|---|
| Unique (%) | 70.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 주식회사에프피에이110111 | 70 |
|---|---|
| 임덕길470529 | 24 |
| 대성건설(주)124-81 | 20 |
| 강성복551227 | 12 |
| 백승식720219 | 12 |
| Other values (150746) |
| Value | Count | Frequency (%) | |
| 주식회사에프피에이110111 | 70 | < 0.1% | |
| 임덕길470529 | 24 | < 0.1% | |
| 대성건설(주)124-81 | 20 | < 0.1% | |
| 강성복551227 | 12 | < 0.1% | |
| 백승식720219 | 12 | < 0.1% | |
| (주)지케이씨교역540605 | 10 | < 0.1% | |
| 김숙경550210 | 9 | < 0.1% | |
| 김현순650213 | 8 | < 0.1% | |
| 이정애740710 | 8 | < 0.1% | |
| 김현영731018 | 8 | < 0.1% | |
| Other values (150741) | 214913 | 99.9% |
Length
| Max length | 25 |
|---|---|
| Mean length | 9.006541326 |
| Min length | 8 |
| Value | Count | Frequency (%) | |
| Other_Letter | 502 | 90.6% | |
| Uppercase_Letter | 26 | 4.7% | |
| Decimal_Number | 10 | 1.8% | |
| Lowercase_Letter | 9 | 1.6% | |
| Other_Punctuation | 3 | 0.5% | |
| Open_Punctuation | 1 | 0.2% | |
| Dash_Punctuation | 1 | 0.2% | |
| Space_Separator | 1 | 0.2% | |
| Close_Punctuation | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| Hangul | 502 | 90.6% | |
| Latin | 35 | 6.3% | |
| Common | 17 | 3.1% |
| Value | Count | Frequency (%) | |
| Hangul | 502 | 90.6% | |
| ASCII | 52 | 9.4% |
주소
Categorical
| Distinct count | 46 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 경기 | |
|---|---|
| 서울 | |
| 인천 | |
| 경상 | 11440 |
| 광주 | 9080 |
| Other values (41) |
| Value | Count | Frequency (%) | |
| 경기 | 59425 | 27.6% | |
| 서울 | 54498 | 25.3% | |
| 인천 | 16745 | 7.8% | |
| 경상 | 11440 | 5.3% | |
| 광주 | 9080 | 4.2% | |
| 부산 | 8535 | 4.0% | |
| 전라 | 8133 | 3.8% | |
| 강원 | 7861 | 3.7% | |
| 충청 | 7778 | 3.6% | |
| 대전 | 6562 | 3.1% | |
| Other values (36) | 25037 | 11.6% |
Length
| Max length | 2 |
|---|---|
| Mean length | 1.999958158 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Other_Letter | 46 | 93.9% | |
| Decimal_Number | 1 | 2.0% | |
| Other_Punctuation | 1 | 2.0% | |
| Space_Separator | 1 | 2.0% |
| Value | Count | Frequency (%) | |
| Hangul | 46 | 93.9% | |
| Common | 3 | 6.1% |
| Value | Count | Frequency (%) | |
| Hangul | 46 | 93.9% | |
| ASCII | 3 | 6.1% |
상품금액
Real number (ℝ)
| Distinct count | 24 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -5.338303537003924e-17 |
|---|---|
| Minimum | -0.7986044124117965 |
| Maximum | 1.5977112022246545 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | -0.7986044124 |
|---|---|
| 5-th percentile | -0.4220822269 |
| Q1 | -0.1827425431 |
| median | 0.1383228864 |
| Q3 | 0.1383228864 |
| 95-th percentile | 0.3134494843 |
| Maximum | 1.597711202 |
| Range | 2.396315615 |
| Interquartile range (IQR) | 0.3210654295 |
Descriptive statistics
| Standard deviation | 0.2108972254 |
|---|---|
| Coefficient of variation (CV) | -3.950641322e+15 |
| Kurtosis | -0.6663673791 |
| Mean | -5.338303537e-17 |
| Median Absolute Deviation (MAD) | 0.1830777386 |
| Skewness | -0.6135871896 |
| Sum | -1.148237061e-11 |
| Variance | 0.04447763968 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.79860441 -0.78401053 -0.62201843 -0.44835122 -0.41916345 ... 0.38875392 0.4465457 0.50492123 1.08167149 1.5977112 ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0.1383228864 | 92906 | 43.2% | |
| 0.05075958742 | 35183 | 16.4% | |
| -0.2994936084 | 31927 | 14.8% | |
| -0.1827425431 | 21790 | 10.1% | |
| 0.3134494843 | 15757 | 7.3% | |
| -0.4220822269 | 13124 | 6.1% | |
| -0.1243670105 | 2357 | 1.1% | |
| -0.03680371153 | 848 | 0.4% | |
| -0.2294429692 | 268 | 0.1% | |
| -0.4746202063 | 234 | 0.1% | |
| Other values (14) | 700 | 0.3% |
| Value | Count | Frequency (%) | |
| -0.7986044124 | 40 | < 0.1% | |
| -0.7694166461 | 3 | < 0.1% | |
| -0.4746202063 | 234 | 0.1% | |
| -0.4220822269 | 13124 | 6.1% | |
| -0.4162446737 | 56 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.597711202 | 10 | < 0.1% | |
| 0.5656317853 | 48 | < 0.1% | |
| 0.5177638485 | 1 | < 0.1% | |
| 0.4920786141 | 78 | < 0.1% | |
| 0.4010127832 | 26 | < 0.1% |
| Distinct count | 935 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -9.72522624563091e-17 |
|---|---|
| Minimum | -0.998736450200167 |
| Maximum | 7.221497364246479 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | -0.9987364502 |
|---|---|
| 5-th percentile | -0.9831526693 |
| Q1 | -0.924187012 |
| median | -0.5956640641 |
| Q3 | 0.01083983987 |
| 95-th percentile | 3.043359359 |
| Maximum | 7.221497364 |
| Range | 8.220233814 |
| Interquartile range (IQR) | 0.9350268519 |
Descriptive statistics
| Standard deviation | 1.364240623 |
|---|---|
| Coefficient of variation (CV) | -1.402785486e+16 |
| Kurtosis | 1.691116703 |
| Mean | -9.725226246e-17 |
| Median Absolute Deviation (MAD) | 1.02395518 |
| Skewness | 1.649109372 |
| Sum | -2.091837814e-11 |
| Variance | 1.861152476 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.99873645 -0.99852586 -0.99789408 -0.99052338 -0.98462681 ... 4.56972752 5.03471384 5.317749 5.72208494 7.22149736], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| -0.9831526693 | 13853 | 6.4% | |
| 3.043359359 | 13538 | 6.3% | |
| -0.949458008 | 8544 | 4.0% | |
| -0.974729004 | 7931 | 3.7% | |
| 2.335771472 | 6289 | 2.9% | |
| -0.9663053387 | 5958 | 2.8% | |
| -0.898916016 | 5126 | 2.4% | |
| -0.848374024 | 5048 | 2.3% | |
| -0.9696748048 | 4548 | 2.1% | |
| -0.696748048 | 3714 | 1.7% | |
| Other values (925) | 140545 | 65.3% |
| Value | Count | Frequency (%) | |
| -0.9987364502 | 137 | 0.1% | |
| -0.9983152669 | 49 | < 0.1% | |
| -0.9974729004 | 6 | < 0.1% | |
| -0.9966305339 | 7 | < 0.1% | |
| -0.9962093506 | 6 | < 0.1% |
| Value | Count | Frequency (%) | |
| 7.221497364 | 1 | < 0.1% | |
| 7.167585906 | 1 | < 0.1% | |
| 7.086718719 | 3 | < 0.1% | |
| 6.817161428 | 1 | < 0.1% | |
| 6.581298799 | 13 | < 0.1% |
해약금액
Real number (ℝ)
| Distinct count | 1824 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.959851466061581e-17 |
|---|---|
| Minimum | -1.0 |
| Maximum | 38.33902623366825 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | -1 |
| Q3 | -1 |
| 95-th percentile | 7.711726823 |
| Maximum | 38.33902623 |
| Range | 39.33902623 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.590347583 |
|---|---|
| Coefficient of variation (CV) | 1.213016134e+17 |
| Kurtosis | 16.95820195 |
| Mean | 2.959851466e-17 |
| Median Absolute Deviation (MAD) | 1.768771693 |
| Skewness | 4.112052435 |
| Sum | 6.366462912e-12 |
| Variance | 12.89059576 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-1. -0.99680294 -0.99299692 -0.99188049 -0.98848045 ... 25.67994885 25.90308313 25.90321 26.44901869 38.33902623], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| -1 | 176137 | 81.9% | |
| -0.8985061243 | 7801 | 3.6% | |
| -0.8477591864 | 2470 | 1.1% | |
| 18.73040944 | 1955 | 0.9% | |
| -0.8173110237 | 1538 | 0.7% | |
| -0.7970122485 | 1431 | 0.7% | |
| 15.26946828 | 1119 | 0.5% | |
| -0.7564146982 | 1099 | 0.5% | |
| -0.715817148 | 481 | 0.2% | |
| -0.6955183728 | 372 | 0.2% | |
| Other values (1814) | 20691 | 9.6% |
| Value | Count | Frequency (%) | |
| -1 | 176137 | 81.9% | |
| -0.9936058858 | 17 | < 0.1% | |
| -0.9923879593 | 120 | 0.1% | |
| -0.9913730206 | 16 | < 0.1% | |
| -0.9898506124 | 33 | < 0.1% |
| Value | Count | Frequency (%) | |
| 38.33902623 | 2 | < 0.1% | |
| 36.18735607 | 1 | < 0.1% | |
| 35.53779526 | 1 | < 0.1% | |
| 31.1056592 | 1 | < 0.1% | |
| 30.96295881 | 1 | < 0.1% |
| Distinct count | 2157 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 더피플라이프 | |
|---|---|
| 금강종합상조(주) | |
| 강대석 | 6336 |
| 김영권 | 4857 |
| 이덕술 | 4749 |
| Other values (2152) |
| Value | Count | Frequency (%) | |
| 더피플라이프 | 72013 | 33.5% | |
| 금강종합상조(주) | 28707 | 13.3% | |
| 강대석 | 6336 | 2.9% | |
| 김영권 | 4857 | 2.3% | |
| 이덕술 | 4749 | 2.2% | |
| 김영경 | 4241 | 2.0% | |
| 제이앤지 | 1848 | 0.9% | |
| 심상열 | 1818 | 0.8% | |
| 고달진 | 1703 | 0.8% | |
| 안미나 | 1489 | 0.7% | |
| Other values (2147) | 87333 | 40.6% |
Length
| Max length | 11 |
|---|---|
| Mean length | 4.831027365 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Other_Letter | 252 | 96.9% | |
| Uppercase_Letter | 4 | 1.5% | |
| Decimal_Number | 2 | 0.8% | |
| Open_Punctuation | 1 | 0.4% | |
| Close_Punctuation | 1 | 0.4% |
| Value | Count | Frequency (%) | |
| Hangul | 252 | 96.9% | |
| Common | 4 | 1.5% | |
| Latin | 4 | 1.5% |
| Value | Count | Frequency (%) | |
| Hangul | 252 | 96.9% | |
| ASCII | 8 | 3.1% |
연체횟수
Real number (ℝ)
| Distinct count | 352 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -1.247365974983095e-16 |
|---|---|
| Minimum | -26.68776495086311 |
| Maximum | 5.838577246426644 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | -26.68776495 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | -1 |
| Q3 | 0.9538792133 |
| 95-th percentile | 4.286967283 |
| Maximum | 5.838577246 |
| Range | 32.5263422 |
| Interquartile range (IQR) | 1.953879213 |
Descriptive statistics
| Standard deviation | 2.133398092 |
|---|---|
| Coefficient of variation (CV) | -1.710322499e+16 |
| Kurtosis | 24.98001465 |
| Mean | -1.247365975e-16 |
| Median Absolute Deviation (MAD) | 1.460643645 |
| Skewness | -2.079096372 |
| Sum | -2.68300937e-11 |
| Variance | 4.551387419 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-26.68776495 -23.15354226 -21.7743334 -20.68245972 -20.51005861 ... 4.71797005 5.29264041 5.69490966 5.80984373 5.83857725], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| -1 | 118228 | 55.0% | |
| -0.9425329643 | 4983 | 2.3% | |
| 4.689236533 | 3586 | 1.7% | |
| -0.8850659286 | 2372 | 1.1% | |
| -0.7126648216 | 1885 | 0.9% | |
| 2.275621034 | 1862 | 0.9% | |
| -0.8275988929 | 1758 | 0.8% | |
| -0.7701318573 | 1688 | 0.8% | |
| 1.873351784 | 1535 | 0.7% | |
| 2.045752891 | 1526 | 0.7% | |
| Other values (342) | 75671 | 35.2% |
| Value | Count | Frequency (%) | |
| -26.68776495 | 1 | < 0.1% | |
| -23.18227577 | 1 | < 0.1% | |
| -23.12480874 | 2 | < 0.1% | |
| -23.0673417 | 2 | < 0.1% | |
| -22.8949406 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5.838577246 | 149 | 0.1% | |
| 5.781110211 | 83 | < 0.1% | |
| 5.723643175 | 60 | < 0.1% | |
| 5.666176139 | 52 | < 0.1% | |
| 5.608709104 | 42 | < 0.1% |
성별
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 113423 | 52.7% | |
| 0 | 101671 | 47.3% |
나이
Real number (ℝ)
| Distinct count | 91 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -5.021176594211611e-18 |
|---|---|
| Minimum | -0.6220244293954389 |
| Maximum | 1.1598604034546345 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | -0.6220244294 |
|---|---|
| 5-th percentile | -0.4420360624 |
| Q1 | -0.172053512 |
| median | 0.007934854945 |
| Q3 | 0.1699243852 |
| 95-th percentile | 0.4219080989 |
| Maximum | 1.159860403 |
| Range | 1.781884833 |
| Interquartile range (IQR) | 0.3419778972 |
Descriptive statistics
| Standard deviation | 0.2560015726 |
|---|---|
| Coefficient of variation (CV) | -5.098437942e+16 |
| Kurtosis | -0.4298260064 |
| Mean | -5.021176594e-18 |
| Median Absolute Deviation (MAD) | 0.2079769809 |
| Skewness | 0.007568506106 |
| Sum | -1.080024958e-12 |
| Variance | 0.06553680519 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.62202443 -0.59502617 -0.57702734 -0.5590285 -0.54102966 ... 0.95287378 0.97087262 1.01586971 1.15086099 1.1598604 ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0.07993020173 | 6518 | 3.0% | |
| 0.06193136503 | 6261 | 2.9% | |
| 0.04393252834 | 6111 | 2.8% | |
| -0.06406049184 | 6033 | 2.8% | |
| -0.01006398175 | 5876 | 2.7% | |
| 0.007934854945 | 5852 | 2.7% | |
| 0.09792903842 | 5840 | 2.7% | |
| -0.04606165514 | 5569 | 2.6% | |
| 0.1159278751 | 5525 | 2.6% | |
| -0.02806281845 | 5511 | 2.6% | |
| Other values (81) | 155998 | 72.5% |
| Value | Count | Frequency (%) | |
| -0.6220244294 | 112 | 0.1% | |
| -0.6040255927 | 182 | 0.1% | |
| -0.586026756 | 313 | 0.1% | |
| -0.5680279193 | 483 | 0.2% | |
| -0.5500290826 | 671 | 0.3% |
| Value | Count | Frequency (%) | |
| 1.159860403 | 24 | < 0.1% | |
| 1.141861567 | 3 | < 0.1% | |
| 1.12386273 | 1 | < 0.1% | |
| 1.105863893 | 1 | < 0.1% | |
| 1.051867383 | 2 | < 0.1% |
| Distinct count | 605 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.22027684045492693 |
|---|---|
| Minimum | 0.002564102564102564 |
| Maximum | 1.2512820512820513 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0.002564102564 |
|---|---|
| 5-th percentile | 0.002777777778 |
| Q1 | 0.012 |
| median | 0.06923076923 |
| Q3 | 0.1722222222 |
| 95-th percentile | 1 |
| Maximum | 1.251282051 |
| Range | 1.248717949 |
| Interquartile range (IQR) | 0.1602222222 |
Descriptive statistics
| Standard deviation | 0.3297116756 |
|---|---|
| Coefficient of variation (CV) | 1.496805905 |
| Kurtosis | 1.024747933 |
| Mean | 0.2202768405 |
| Median Absolute Deviation (MAD) | 0.2521903989 |
| Skewness | 1.612337493 |
| Sum | 47380.22672 |
| Variance | 0.108709789 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0025641 0.00267094 0.00305556 0.00356061 0.00381702 ... 0.98666667 0.99083333 0.99583333 1.00357143 1.25128205], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 1 | 23750 | 11.0% | |
| 0.002564102564 | 8753 | 4.1% | |
| 0.003846153846 | 7785 | 3.6% | |
| 0.007692307692 | 7232 | 3.4% | |
| 0.002777777778 | 5097 | 2.4% | |
| 0.004 | 4548 | 2.1% | |
| 0.01 | 4170 | 1.9% | |
| 0.01538461538 | 4131 | 1.9% | |
| 0.008 | 3407 | 1.6% | |
| 0.02307692308 | 3387 | 1.6% | |
| Other values (595) | 142834 | 66.4% |
| Value | Count | Frequency (%) | |
| 0.002564102564 | 8753 | 4.1% | |
| 0.002777777778 | 5097 | 2.4% | |
| 0.003333333333 | 8 | < 0.1% | |
| 0.003787878788 | 49 | < 0.1% | |
| 0.003846153846 | 7785 | 3.6% |
| Value | Count | Frequency (%) | |
| 1.251282051 | 1 | < 0.1% | |
| 1.01 | 7 | < 0.1% | |
| 1.007142857 | 1 | < 0.1% | |
| 1 | 23750 | 11.0% | |
| 0.9916666667 | 18 | < 0.1% |
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1685216695956187 |
|---|---|
| Minimum | 0 |
| Maximum | 4 |
| Zeros | 3665 |
| Zeros (%) | 1.7% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 4 |
| Range | 4 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.063133396 |
|---|---|
| Coefficient of variation (CV) | 0.490257216 |
| Kurtosis | -1.500647419 |
| Mean | 2.16852167 |
| Median Absolute Deviation (MAD) | 1.008188777 |
| Skewness | -0.1720879326 |
| Sum | 466436 |
| Variance | 1.130252619 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 1.5 2.5 3.5 4. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 3 | 106467 | 49.5% | |
| 1 | 84623 | 39.3% | |
| 4 | 10867 | 5.1% | |
| 2 | 9472 | 4.4% | |
| 0 | 3665 | 1.7% |
| Value | Count | Frequency (%) | |
| 0 | 3665 | 1.7% | |
| 1 | 84623 | 39.3% | |
| 2 | 9472 | 4.4% | |
| 3 | 106467 | 49.5% | |
| 4 | 10867 | 5.1% |
| Value | Count | Frequency (%) | |
| 4 | 10867 | 5.1% | |
| 3 | 106467 | 49.5% | |
| 2 | 9472 | 4.4% | |
| 1 | 84623 | 39.3% | |
| 0 | 3665 | 1.7% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| df_index | 회원번호 | 회원이름 | 주소 | 상품금액 | 총불입액 | 해약금액 | 담당자 | 연체횟수 | 성별 | 나이 | 진행률 | 상태 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0022A00001 | 이옥성590318 | 경기 | -0.299494 | 3.043359 | 18.730409 | 더피플라이프 | -1.000000 | 0 | 0.097929 | 1.00 | 0 |
| 1 | 1 | 0072A00001 | 안성열581125 | 경기 | -0.299494 | 3.043359 | -1.000000 | 더피플라이프 | -1.000000 | 0 | 0.115928 | 1.00 | 2 |
| 2 | 2 | 0072A00002 | 배준택831121 | 부산 | -0.299494 | 1.749484 | -1.000000 | 더피플라이프 | 0.838945 | 0 | -0.334043 | 0.68 | 3 |
| 3 | 3 | 0072A00003 | 배민규821023 | 울산 | -0.299494 | 3.043359 | -1.000000 | 더피플라이프 | -1.000000 | 0 | -0.316044 | 1.00 | 2 |
| 4 | 4 | 0072A00006 | 최금순340728 | 경기 | -0.299494 | 3.043359 | -1.000000 | 더피플라이프 | -1.000000 | 1 | 0.547900 | 1.00 | 2 |
| 5 | 5 | 0072A00007 | 주병오520206 | 서울 | -0.299494 | 1.830352 | 11.301058 | 더피플라이프 | 0.724011 | 0 | 0.223921 | 0.70 | 1 |
| 6 | 6 | 0072A00021 | 정성제760210 | 서울 | -0.299494 | 3.043359 | -1.000000 | 더피플라이프 | -1.000000 | 0 | -0.208051 | 1.00 | 4 |
| 7 | 7 | 0072A00022 | 신영주541016 | 충청 | -0.299494 | 3.043359 | -1.000000 | 더피플라이프 | -1.000000 | 1 | 0.187923 | 1.00 | 4 |
| 8 | 8 | 0072A00026 | 윤일선521001 | 서울 | -0.299494 | 2.881625 | 17.938757 | 더피플라이프 | -0.770132 | 1 | 0.223921 | 0.96 | 1 |
| 9 | 9 | 0072A00027 | 김건용530815 | 서울 | -0.299494 | 3.043359 | -1.000000 | 더피플라이프 | -1.000000 | 1 | 0.205922 | 1.00 | 4 |
Last rows
| df_index | 회원번호 | 회원이름 | 주소 | 상품금액 | 총불입액 | 해약금액 | 담당자 | 연체횟수 | 성별 | 나이 | 진행률 | 상태 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 215084 | 234600 | U022A21089 | 백쌍순320814 | 부산 | -0.299494 | 1.345148 | 8.256241 | 더피플라이프 | 1.413615 | 1 | 0.583898 | 0.580000 | 1 |
| 215085 | 234601 | U022A21090 | 손희락520228 | 부산 | -0.299494 | 2.355988 | 15.370962 | 더피플라이프 | -0.023060 | 0 | 0.223921 | 0.830000 | 1 |
| 215086 | 234602 | U022A21305 | 길준분660312 | 부산 | -0.299494 | 1.183414 | 7.403693 | 더피플라이프 | 1.643484 | 1 | -0.028063 | 0.540000 | 1 |
| 215087 | 234605 | U022A21379 | 김관수370127 | 경남 | -0.299494 | -0.191328 | -1.000000 | 더피플라이프 | 3.597363 | 0 | 0.493903 | 0.200000 | 1 |
| 215088 | 234606 | U022A22154 | 김순옥561026 | 부산 | -0.299494 | -0.878699 | -1.000000 | 더피플라이프 | 4.574302 | 1 | 0.151926 | 0.030000 | 1 |
| 215089 | 234607 | U022A22155 | 조길찬761012 | 부산 | -0.299494 | -0.878699 | -1.000000 | 더피플라이프 | 4.574302 | 0 | -0.208051 | 0.030000 | 1 |
| 215090 | 234609 | U244A00803 | 배상호820807 | 부산 | -0.299494 | 1.897741 | 11.585241 | 더피플라이프 | -0.023060 | 0 | -0.316044 | 0.716667 | 1 |
| 215091 | 234610 | U244A00804 | 배규태830402 | 부산 | -0.299494 | 3.043359 | -1.000000 | 더피플라이프 | -1.000000 | 0 | -0.334043 | 1.000000 | 4 |
| 215092 | 234611 | U244A00805 | 김군자490809 | 부산 | -0.299494 | -0.932611 | -1.000000 | 더피플라이프 | 2.390555 | 1 | 0.277917 | 0.016667 | 1 |
| 215093 | 234612 | U244A00806 | 정일선540801 | 부산 | -0.299494 | -0.191328 | -1.000000 | 더피플라이프 | 1.758418 | 1 | 0.187923 | 0.200000 | 1 |